Greedy Connectivity of Geographically Embedded Graphs 
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We introduce a measure of greedy connecttvity for geographical networks (graphs embedded in 
space) and where the search for connecting paths rehes only on local information, such as a node's 
location and that of its neighbors. Constraints of this type are common in everyday life applications. 
Greedy connectivity accounts also for imperfect transmission across established links and is larger 
the higher the proportion of nodes that can be reached from other nodes with a high probability. 
Greedy connectivity can be used as a criterion for optimal network design. 
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Large, complex graphs, or networks, have been the 
subject of much recent interest due to their ubiquity in 
everyday life and virtually all walks of science [1] . In par- 
ticular, the ability to connect any two nodes by a contin- 
uous path along the graph's edges is crucial to its func- 
tion (transmitting information, controlling the spread of 
disease, etc.) and has been studied at length [1]. 

A graph G(y, E) consists of a set V oi N vertices 
1, 2, . . . , and a set E of edges, or links (i, j), connect- 
ing between the nodes (i and j). The nodes i and j are 
neighbors. Nodes s and t are connected if a continuous 
path of edges {s,Vi),(vi,V2), ■ ■ ■ ,(yi-i,t) can be found 
between the two nodes. In this view, connectivity is a 
global property: a complete knowledge of the graph is 
required to decide which pairs of nodes are connected. 

In this letter we address the question of connectivity 
in a different, yet commonly encountered setting. Con- 
sider, for example, the paradigmatic experiment of the 
social psychologist Stanley Milgram [2], who asked peo- 
ple in Omaha, Nebraska, to deliver a postcard to another 
person in Boston, Massachusetts. The name and address 
of the target person was disclosed, but the participants 
were to deliver the postcards only to people they knew on 
a first-name basis. If they did not know the target, the 
postcard was to be delivered to an acquaintance, who 
would then deliver it onward following the same rules, 
etc. About 20% of the cards reached their target, tak- 
ing an average of 5.5 steps, a result that gave rise to the 
idea of "six degrees of separation" and the small world 
phenomenon [3]. 

Two major ingredients are different in Milgram's ex- 
periment from the usual concept of graph connectivity: 
(1) Connectivity is established from local information 
alone — participants knew little else beyond their own 
acquaintances and had no access to the full net of social 
contacts. (2) The network in question is embedded in 
space, i.e., each node (person) has a well defined loca- 
tion. The decision who to mail the postcard to is clearly 
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influenced by distance from the target. This situation 
is not uncommon: global information is rarely available 
in large complex networks, while geographically embed- 
ded nets include numerous important examples, such as 
routers of the Internet, networks of flight connections, 
the electricity power grid, and neurons in the brain, to 
name a few, and navigation from local information is an 
attractive problem [4-13]. 

Consider then a graph G{V, E) embedded in space, and 
denote the geographical (Euclidean) distance between 
nodes by d{i,j). We wish to establish whether a source 
node s is connected to a target node t, whose location 
is disclosed, relying only on local information. Inspired 
by Kleinberg [11, 12], we model the search for connectiv- 
ity by the greedy algorithm: Make the next step to that 
neighbor that is closest to the target, provided that the dis- 
tance diminishes. Or, symbolically, (s, vi,V2j ■ ■ ■ , t) 
is a greedy path of length £ from s = vq to t = vg if for 
/c = 1,2,...,^ 

d{vk,t) < d{i,t), and d{vk,t) < d{vk-i,t), (1) 

for all the neighbors i ^ Vk of Vk-i- We are assuming 
that the nodes are placed in a continuum so that no two 
pairs of nodes are at the same distance from one another. 
With this understanding, greedy paths are unique. If (1) 
is fulfilled for some £, we say that s and t are greedily 
connected. 

By definition, a greedy path is automatically a path. 
The converse is not true. Many other properties differ- 
entiate between connectivity and greedy connectivity: A 
greedy path is not necessarily reversible — the greedy 
path found from s to t is not always a greedy path from t 
to s; There is no transitivity — if z is greedily connected 
to j and j is greedily connected to k it does not follow 
that i is greedily connected to fc, or in other words, the 
concatenation of greedy paths is not necessarily a greedy 
path; If (s, vi,V2, ■ ■ ■ , vi-i,t) is a greedy path from s to 
t, then (wi, Wi+i, . . . , i) is a greedy path (from Vi to t), 
however, other sub-paths are not always greedy paths — 
e.g., (s, wi, . . . , Vi) might not be a greedy path from s to 
Vi ; Perhaps most surprisingly, adding links to an existing 
network does not necessarily increase greedy connectivity 
and might actually have the opposite effect. 
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FIG. 1: (Color online) Circle-embedded ER graph with A'^ = 
21/ + 1 = 21 nodes and p = 0.2. "Short-ranged" links, to just 
one or two nodes away, are highlighted in a different shade. 

Due to the irreversibility of greedy paths, one cannot 
define a greedily connected component of a graph. In- 
stead, we propose measuring greedy connectivity by the 
quantity 

GC = Y1 ^^ste-'''^' /N{N-1), (2) 

s,t 

where the sum runs over all N{N — 1) pairs of nodes s 
and t (s ^ t), a St = 1 if there exists a greedy path from 
s to t and is otherwise, and ist is the length of the 
greedy path from s to t (when it exists). For /i = 0, 
GC is simply the fraction of all pairs that are greedily 
connected, /i can be thought of as a chemical potential^ 
ov uj = e^'^ can be interpreted as the probability to make 
the transition across a single link successfully. (This is 
important in situations such as the Milgram experiment, 
where a; < 1.) GC{uj) is the actual fraction of successful 
connections between all possible pairs of nodes when the 
transmission probability across each link is w. 

We now turn to some key examples. Consider first an 
Erdos-Renyi (ER) random graph embedded in a circle, 
where each link is realized with probability p (Fig. 1). 
For simplicity, assume that the = 2L -|- 1 nodes are 
equally spaced, = (cos sin 2ii) g R^^ avoid 
degeneracy of greedy paths, we introduce a small random 
perturbation to the location of each node. Alternatively, 
one can work with equal spacing and preserve uniqueness 
by making an arbitrary random choice when more than 
one option for a greedy step becomes available. Distances 
can be measured as either jr^ — r^| or min{|z — j|,iV — 
\i — to the same effect. We opt for the latter. 

Denote by P(£, to) the probability that two nodes, m 



lattice spacings apart, are connected by a greedy path of 
i steps. It obeys the equation 

m — 1 

P(£,to) =pP(^-l,0) + (l-(z2) ^ q^^-^P{t^l,k), (3) 

fc=i 

(q = 1 — p is the probability that a link is absent), with 
boundary condition P(£, 0) = 5ifi. The first term on the 
rhs denotes the event that there is a direct link between 
the target and source (probability p) and the boundary 
condition tells us that the greedy path has then length 
1. The first term implied by the sum refers to the case 
that the direct link is absent (prob. q) but a link to at 
least one of the two nearest neighbors of the target exists 
(prob. 1 — q^)] from there, one needs a greedy path of 
length £ — 1 (since one step has already been taken) to 
the target at distance fc = 1, expressed by the P(£ — 1, k). 
Successive terms model the events that increasingly more 
links to the sites surrounding the target are absent. 
Equation (3) can be solved in standard ways, to yield 

m m 

P^{m) = Y,P{l,m)J =pu\{[l + u{l-q^)q^^-^] . 
1=1 k=2 

(4) 

Finally, using GC{uj) = {l/L)J2^^^P^{m), we get 
GCic.) = p.. 1 + X: n [1 + - q'h'"-'] ) ■ 

\ m=2 k=2 / 

(5) 

It is interesting to note that puj is the greedy connectiv- 
ity that would result if the only greedy path available 
between any two nodes were a direct link (that occurs 
with prob. p). Thus, the remaining factor is the enhance- 
ment to the GC that occurs as a result of other available 
paths, when the direct link is absent. This enhancement 
factor is bounded by e" and achieves its maximum near 
p ^ \/VL. Typical results for the greedy connectivity of 
ER graphs, comparing our theoretical analysis to com- 
puter simulations, are shown in Fig. 2. 

Next, consider circularly embedded Small- World (SW) 
networks [14]. We start with the underlying "lattice" 
configuration, where each of the = 2L + 1 nodes is 
connected to Z-nearest neighbors on either side (Fig. 3a). 
As before, the nodes are slightly perturbed from their 
lattice centers, to avoid degeneracy of greedy paths. The 
equation for P{1, to) reads 

P(^,to) = (6) 

where \x\ is the smallest integer greater or equal to x. 
We then have P„(to) = -P(^> = ^^"^'^ > and 

1 ^ 

GC(a;) = - ^ P^to) 

m=l (7) 

^ / 2 L/l\ l-UJ^'P 

L \ J 1 — cj 
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FIG. 2: (Color online) Greedy connectivity of circularly em- 
bedded ER graph, with L = 100 and p — 0.05, as a function 
of the transmission probability uj. Inset: The network en- 
hancement factor, GC(u})/pu), as a function of p, for ui = 0.5. 




FIG. 3: (Color online) (a) Circularly embedded lattice (left) 
with N = 2L -\- 1 = 21 nodes and I — 2. (b) Circularly em- 
bedded Small- World network (right), obtained by removing a 
fraction e = 0.2 of the links and reconnecting them between 
random pairs of nodes. 



where, for simplicity, we have assumed that L is a mul- 
tiple of and we write l/L ~ p for comparison with ER 
graphs (this yields the same number of links in either 
case). Indeed, it is interesting to note that the greedy 
connectivity of the lattice is always larger than that of 
an equivalent ER graph. The lattice architecture guar- 
antees that any two sites are connected, yet the typical 
distances are order N, rather than In A^, as in ER graphs. 
The benefits seem to get the upper hand. 

To achieve the small-world effect, a fraction e of the 
links are removed and are then reconnected between ran- 
domly selected pairs of nodes (but avoiding multiple con- 
nections between any pair), see Fig. 3b. Even a small 
fraction e of randomly rerouted links reduces the typical 
shortest path between nodes, from 0{N) to 0(ln A^). We 
now show that the fraction e can be optimized to attain a 
maximum in the greedy connectivity (in particular, out- 
performing the lattice, for which e = 0). 
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FIG. 4: (Color online) Greedy connectivity of circularly em- 
bedded SW graph, with L = 100, I = 2, and uj = 0.85, as 
a function of the fraction of random links, e. Theoretical 
results (solid curve) are compared to numerical simulations 
(symbols). 

The equation for the P{£, m) of SW networks is 

m — 3 

P(£, m) = pP{£ - 1, 0) + (1 - q2) J2 q^^~^P{£ - 1, k) 

k=l 

+ (1 - q'q)q^'"-HP{e ~ 1, m - 2) + q'qP{e - 1, m - 1)}, 

(8) 

where we have specialized to the case oi I — 2. For links 
spanning nodes more than I = 2 lattice spacings apart the 
equation is the same as for ER graphs, with p = el/L, 
now the effective probability of random long-range links. 
The only difference is when the first greedy step is to a 
site within I spacings; these require I specialized terms 
(the last two terms, in our case) because the probability 
of such short-range links is p' = 1 — e+p (and q' — 1 —p'), 
rather than p. Eq. (8) is valid for to > 3. The boundary 
conditions are revised, for the very same reason: 

Pi£, 1) - Si^ip', P{1, 2) = 5,,ip' + 5,,2{l - q'q)p'q' . 

Eq. (8) can be solved by standard techniques. The 
final expression we obtain for GC{u}) is too cumbersome 
to list here, but it agrees perfectly well with numerical 
simulations, as shown in Fig. (4) for one typical case. 
Note the maximum in GC, about e « 0.2, which is nearly 
twice as large as the GC of the corresponding lattice, at 
e = 0, and about 7 times as large as the corresponding 
ER network, at e = 1. Qualitatively similar results are 
obtained for most other link densities, I > 2. For ^ = 1, 
the maximum GC{ijj) occurs always at e = 0, that is, for 
the underlying lattice (a simple ring). 

Our third and last example is that of circularly embed- 
ded scale-free (SF) networks. As usual, the N = 2L + 1 
nodes are to be placed on a ring, slightly perturbed from 
their lattice locations, and we start with a single node 
at (0, 0). We then construct a scale-free net according to 
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FIG. 5: (Color online) Greedy connectivity of circularly em- 
bedded SF nets (with L = 100 and uj = 0.85) as a function of 
their degree distribution exponent, 7 = 1 + 1/r. The GC of 

equivalent ER graphs (- • -) and SW graphs ( ) is shown 

for comparison. 

the redirection algorithm of Krapivsky and Redner [15]: 
Each new node is brought in (to a random location) and 
is connected to one of the existing nodes, selected ran- 
domly, with probability 1 — r. With probability r, the 
connection is redirected to the ancestor of that node (the 
node it was attached to first, when it was added to the 
net). This yields a graph with scale- free degree distribu- 
tion, P{k) ~ , 7 = 1 -I- 1/r. Because the networks 
built in this way are actually trees, the average degree is 
(fc) = 2, so the procedure has the advantage of keeping a 
constant density of links even as r, or 7, is varied. 

In Fig. 5 we present data culled from computer sim- 
ulations of circularly-embedded SF nets. In the limit of 
r — )■ (7 00) the networks are trees with a narrow 
degree distribution, similar to ER graphs. The GC in 
that limit is equal to that of equivalent ER graphs (with 
the same link density). In the opposite limit of r — )■ 1 
(7 — ?> 2) all the links are redirected to one "super-hub" 



and we get a star graph. It is easy to show that in this 
case GC = (l/2)w^ (for L 1). As 7 decreases from 00 
to 2 the GC of the SF networks increases monotonically, 
the largest increase occurring between 3 > 7 > 2, cor- 
responding to the regime encountered in most frequent 
applications [1]. Note that SF networks with 7 < 2.5 ex- 
hibit a greater GC than that of the optimal correspond- 
ing SW net (in this case, of ^ = 1. a simple ring, or e = 0). 

In summary, we have introduced a measure of greedy 
connectivity for geographical networks (graphs embed- 
ded in space) and where the search for a connecting path 
might rely only on local information, such as a node's 
location and that of its neighbors (the ones linked to it). 
This is useful in a host of situations where the networks 
are large and complex and global information is not avail- 
able, or relying on it is impractical due to the network's 
size. Greedy connectivity is larger the larger the fraction 
of connected nodes. 

Greedy connectivity generalizes the Kleinberg naviga- 
tion problem (by which it is inspired) in several ways, 
most importantly, in that nothing is presumed about the 
network structure; the existence of a greedy path between 
any two nodes is not required, and the probability of 
transmission across any given link, uj, now plays a defin- 
ing role. Indeed, Kleinberg-like greedy paths, of minimal 
length, can be found for any geographically embedded 
network by maximizing CC{uj) in the limit of w (or 
IJL — 00). 

An important feature, suggested by the examples ana- 
lyzed herein, is that greedy connectivity can be enhanced 
and optimized by varying the network architecture, in- 
cluding the geographical placement of the nodes. This is 
perhaps the richest venue for future applications. 
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